Supplementary Figure 1 



Sequencer model 
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Overview of the properties of the 1,262 samples used for eQTL and ASE analyses 



Here we show that the samples which we successfully genotyped and used for the eQTL and 
ASE analysis still show high heterogeneity in sequencer models (a), read layout (b), sampled 
tissue (c), cancer status (d), total number of reads (e), read length (f). 



Supplementary Figure 2 
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Sequenced number of bases vs number of 
high quality genotype calls of 218 hematological sampl 



Single end 
□ Paired end 



r~ 

1x10 



9 * •• • 



1 1 1 I I I I | 



1 1 I I I I 



8 



1x10 



9 



1x10 



10 



1 — I — I I I I I I 

1x10 



Total number of sequenced bases 



11 



b 



Sequenced number of bases of the 1,262 
samples used for eQTL and ASE analyses 
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The relation between sequencing depth and the number of high quality genotypes. 

a) We observe a strong relation between the number of sequenced bases and the number of 
high quality genotypes, we do not observe that paired end sequencing improves genotyping. 

b) We observe that newer samples usually have more bases sequenced. 



Supplementary Figure 3 

Figure provided separately. 

Overview of genotyping accuracy and gene-expression over all chromosomes. 



Supplementary Figure 4 
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Median gene expression level relationship with 
genotyping concordance in all Geuvadis samples 
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□ Genotype concordance of directly genotyped SNPs before imputation 

□ Genotype concordance of directly genotyped SNPs after imputation 

□ Genotype concordance of SNPs that have only been imputed 
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Median gene expression level relationship with 
genotyping concordance in European Geuvadis samples 
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□ Genotype concordance of directly genotyped SNPs before imputation 

□ Genotype concordance of directly genotyped SNPs after imputation 

□ Genotype concordance of SNPs that have only been imputed 



Relation between gene-expression levels and genotype concordance before and after 
imputation. 

Genotype concordances in all Geuvadis samples (a) and European Geuvadis samples (b) of 
common SNPs (MAF > 0.05, DR2 > 0.8) before and after imputation grouped by the median 
expression levels of their genes. 



Supplementary Figure 5 
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Brain specific eQTL genes 
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Wilcoxon p-value: 2.93 x 10 
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Liver specific eQTL genes 
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Bladder specific eQTL genes 
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Breast specific eQTL genes 
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Expression of tissue-specific c/s-eQTL genes vs Geuvadis expression. 



We find that genes with tissue specific c/s-eQTLs are more abundantly expressed in the 
respective tissues compared to the Geuvadis samples in which we did not observe the cis- 
eQTLs. 



Supplementary Figure 6 
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GSTM5 expression in different tissues 
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DDTL expression in different tissues 
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PSCA expression in different tissues 










r 



i r~ 



— r 



T 



9b? 




T 



T 



n 



0 



Expression of example tissue-specific eQTL in different tissues. 

Here we show three example tissue-specific eQTL genes: a) GSTM5, brain-specific, b) 
DDTL, liver-specific, c) PSCA, bladder specific. 



Supplementary Figure 7 



a 



b 



CD 
CD 

05 
CD 

= 100 



05 
E 

00 

LU 



CD 

5 



CD 

CO 

CD 
CD 

05 

C 

CD 
O 

CD 



■a 

05 
E 

00 

LU 



80 



60 



05 
CD 

c5 

CD 
CD 
05 

C 

CD 
O 

CD 20 



40 



0 



100 



80 



60 



40 



20 



0 



Expected functional impact 



Wilcoxon p-value: 7.52x10 
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Predicted functional impact of ASE variants. 



The annotation of ASE SNPs predicted impact and effect was performed using SnpEff. (a) 
Most of the high-impact SNPs have lower expression of the alternative allele, (b) The majority 
of the SNPs introducing a stop codon have lower expression of the alternative allele. 



Supplementary Figure 8 
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MASP2 expression in different tissues 
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IRF4 expression in different tissues 
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Expression of MASP2 gene is liver-specific and the expression of IRF4 gene is 
hematopoietic-specific. 

(a) MASP2 gene has higher expression in liver compared to other tissues, (b) IRF4 gene has 
higher expression in hematopoietic cells compared to other tissues. 



Supplementary Figure 9 

Sample exclusion using PCA on gene expression levels 
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521 excluded samples: 

- 3 DNA-seq samples 

- 312 single cell experiments 

- 97 HLA targeted samples 
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Component 1 



PCA on expression values shows strong outliers that are removed from the analysis. 



The 521 outliers of the first component (left of the red line) where removed from our analyses 



Supplementary figure 10 

Figure provided separately. 

Correlation of principal components vs different confounders. 



Supplementary Figure 11 
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Identification of duplicate samples. 



We used a cut-off of 78% identity to select duplicate markers. By using this cut-off level we 
could identify all the duplicates, which we expected based on the annotations. The reason 
that we have multiple peaks above this cut-off is due to the difference in genotyping quality 
among the samples. Panel b is the enlargement of the lower part of panel a. 



Supplementary Figure 12 
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Expression of X/S7and chromosome Y genes. 

We show a clear separation of males and females using both XIST expression and 
chromosome Y expression. In two cases, the samples were annotated as male but clustered 
within the females; these are likely mis-annotations. 



